Search CORE

University of Toronto Research Repository

Cluster analysis of protein array results via similarity of Gene Ontology annotation

Author: AL Edinger
B Cox
B Zhang
BR Zeeberg
C Jane McGlade
CG Proud
CH Wu
Cheryl Wolting
David Tritchler
DL Wheeler
GR Mishra
GW Milligan
H Hu
H Rebholz
H Zhu
J Huang
JE Hirschman
KR Christie
L Kaufman
M Ashburner
MD Robinson
N Kaplan
N Kaplan
P Khatri
P Uetz
PW Lord
RC Gentleman
S Dudoit
SF Barnett
T Kislinger
V Kunin
X Jiang
Y Ho
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: With the advent of high-throughput proteomic experiments such as arrays of purified proteins comes the need to analyse sets of proteins as an ensemble, as opposed to the traditional one-protein-at-a-time approach. Although there are several publicly available tools that facilitate the analysis of protein sets, they do not display integrated results in an easily-interpreted image or do not allow the user to specify the proteins to be analysed. RESULTS: We developed a novel computational approach to analyse the annotation of sets of molecules. As proof of principle, we analysed two sets of proteins identified in published protein array screens. The distance between any two proteins was measured as the graph similarity between their Gene Ontology (GO) annotations. These distances were then clustered to highlight subsets of proteins sharing related GO annotation. In the first set of proteins found to bind small molecule inhibitors of rapamycin, we identified three subsets containing four or five proteins each that may help to elucidate how rapamycin affects cell growth whereas the original authors chose only one novel protein from the array results for further study. In a set of phosphoinositide-binding proteins, we identified subsets of proteins associated with different intracellular structures that were not highlighted by the analysis performed in the original publication. CONCLUSION: By determining the distances between annotations, our methodology reveals trends and enrichment of proteins of particular functions within high-throughput datasets at a higher sensitivity than perusal of end-point annotations. In an era of increasingly complex datasets, such tools will help in the formulation of new, testable hypotheses from high-throughput experimental data

Springer - Publisher Connector

Maastricht University Research Portal

The Biomolecular Interaction Network Database and related tools 2005 update

Author: Alfarano C.
Andrade C. E.
Anthony K.
Bahroos N.
Bajec M.
Bantoft K.
Betel D.
Bobechko B.
Boutilier K.
Burgess E.
Buzadzija K.
Cavero R.
D'Abreo C.
Donaldson I.
Dorairajoo D.
Dumontier M. J.
Dumontier M. R.
Earles V.
Farrall R.
Feldman H.
Garderman E.
Gong Y.
Gonzaga R.
Grytsan V.
Gryz E.
Gu V.
Haldorsen E.
Halupa A.
Haw R.
Hogue C. W. V.
Hrvojic A.
Hurrell L.
Isserlin R.
Jack F.
Juma F.
Khan A.
Kon T.
Konopinsky S.
Le V.
Lee E.
Ling S.
Magidin M.
Moniakis J.
Montojo J.
Moore S.
Muskat B.
Ng I.
Ouellette B. F. F.
Paraiso J. P.
Parker B.
Pawson T.
Pintilie G.
Pirone R.
Salama J. J.
Sgro S.
Shan T.
Shu Y.
Siew J.
Skinner D.
Snyder K.
Stasiuk R.
Strumpf D.
Tao S.
Tuekam B.
Wang Z.
White M.
Willis R.
Wolting C.
Wong S.
Wrong A.
Xin C.
Yao R.
Yates B.
Zhang S.
Zheng K.
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

The Biomolecular Interaction Network Database (BIND) (http://bind.ca) archives biomolecular interaction, reaction, complex and pathway information. Our aim is to curate the details about molecular interactions that arise from published experimental research and to provide this information, as well as tools to enable data analysis, freely to researchers worldwide. BIND data are curated into a comprehensive machine-readable archive of computable information and provides users with methods to discover interactions and molecular mechanisms. BIND has worked to develop new methods for visualization that amplify the underlying annotation of genes and proteins to facilitate the study of molecular interaction networks. BIND has maintained an open database policy since its inception in 1999. Data growth has proceeded at a tremendous rate, approaching over 100 000 records. New services provided include a new BIND Query and Submission interface, a Standard Object Access Protocol service and the Small Molecule Interaction Database (http://smid.blueprint.org) that allows users to determine probable small molecule binding sites of new sequences and examine conserved binding residues

ScholarBank@NUS

Multiconstrained gene clustering based on generalized projections

Author: A Schlicker
A Schliep
Alan Wee-Chung Liew
B Adryan
C Wolting
D Dembélé
D Hanisch
D Huang
D Tritchler
DM Blei
E Kreyszig
H Stark
Hong Yan
J Zeng
J Zeng
J Zeng
J Zeng
J Zeng
J Zeng
J Zeng
Jia Zeng
JL Sevilla
JZ Wang
L Tari
M Aubry
M Kanehisa
M Shiga
MB Eisen
MF Ramoni
MK Kerr
N Bolshakova
P Tamayo
PT Spellman
PW Lord
R Steuer
S Tavazoie
S Zhu
S Zhu
Shanfeng Zhu
TR Hughes
W Feng
W Pan
X Gan
X Guo
XQ Cao
XQ Cao
XQ Cao
Z Bar-Joseph
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Gene clustering for annotating gene functions is one of the fundamental issues in bioinformatics. The best clustering solution is often regularized by multiple constraints such as gene expressions, Gene Ontology (GO) annotations and gene network structures. How to integrate multiple pieces of constraints for an optimal clustering solution still remains an unsolved problem. Results We propose a novel multiconstrained gene clustering (MGC) method within the generalized projection onto convex sets (POCS) framework used widely in image reconstruction. Each constraint is formulated as a corresponding set. The generalized projector iteratively projects the clustering solution onto these sets in order to find a consistent solution included in the intersection set that satisfies all constraints. Compared with previous MGC methods, POCS can integrate multiple constraints from different nature without distorting the original constraints. To evaluate the clustering solution, we also propose a new performance measure referred to as Gene Log Likelihood (GLL) that considers genes having more than one function and hence in more than one cluster. Comparative experimental results show that our POCS-based gene clustering method outperforms current state-of-the-art MGC methods. Conclusions The POCS-based MGC method can successfully combine multiple constraints from different nature for gene clustering. Also, the proposed GLL is an effective performance measure for the soft clustering solutions.</p

Springer - Publisher Connector

A transversal approach to predict gene product networks from ontology-based similarity

Author: A Budanitsky
A Schlicker
A Singhal
Anita Burgun
C Wolting
D Lin
DS Harris
E Agirre
E Camon
E Levy
EB Camon
F Azuaje
FD Gibbons
FJ Field
G Rigau
G Salton
GO Consortium
H Bedrine-Ferran
H Sun
H Wang
IG Wool
J Chabalier
J Chabalier
J Jiang
Jean Mosser
JH Chiang
JM Mariadason
Julie Chabalier
M Gerstein
M Kanehisa
MB Eisen
MD Weiss
ME Brosnan
O Bodenreider
P Joseph
P Khatri
P Resnik
PW Lord
R Baeza-Yates
R Rada
RC Gentleman
T Barrett
T Nakajima
T Yamamoto
TK Jenssen
X Mao
Y Quentin
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Interpretation of transcriptomic data is usually made through a "standard" approach which consists in clustering the genes according to their expression patterns and exploiting Gene Ontology (GO) annotations within each expression cluster. This approach makes it difficult to underline functional relationships between gene products that belong to different expression clusters. To address this issue, we propose a transversal analysis that aims to predict functional networks based on a combination of GO processes and data expression. Results The transversal approach presented in this paper consists in computing the semantic similarity between gene products in a Vector Space Model. Through a weighting scheme over the annotations, we take into account the representativity of the terms that annotate a gene product. Comparing annotation vectors results in a matrix of gene product similarities. Combined with expression data, the matrix is displayed as a set of functional gene networks. The transversal approach was applied to 186 genes related to the enterocyte differentiation stages. This approach resulted in 18 functional networks proved to be biologically relevant. These results were compared with those obtained through a standard approach and with an approach based on information content similarity. Conclusion Complementary to the standard approach, the transversal approach offers new insight into the cellular mechanisms and reveals new research hypotheses by combining gene product networks based on semantic similarity, and data expression.</p

Springer - Publisher Connector

Biochemical and Computational Analysis Of LNX1 Interacting Proteins

Author: Brittany C. Prevost
C. Jane Mcglade
Cheryl D. Wolting
Emily K. Griffiths
Leanne E. Wybenga-groot
Renu Sarao
Publication venue
Publication date: 08/11/2011
Field of study

PDZ (Post-synaptic density, 95 kDa, Discs large, Zona Occludens-1) domains are protein interaction domains that bind to the carboxy-terminal amino acids of binding partners, heterodimerize with other PDZ domains, and also bind phosphoinositides. PDZ domain containing proteins are frequently involved in the assembly of multi-protein complexes and clustering of transmembrane proteins. LNX1 (Ligand of Numb, protein X 1) is a RING (Really Interesting New Gene) domain-containing E3 ubiquitin ligase that also includes four PDZ domains suggesting it functions as a scaffold for a multiprotein complex. Here we use a human protein array to identify direct LNX1 PDZ domain binding partners. Screening of 8,000 human proteins with isolated PDZ domains identified 53 potential LNX1 binding partners. We combined this set with LNX1 interacting proteins identified by other methods to assemble a list of 220 LNX1 interacting proteins. Bioinformatic analysis of this protein list was used to select interactions of interest for future studies. Using this approach we identify and confirm six novel LNX1 binding partners: KCNA4, PAK6, PLEKHG5, PKC-alpha1, TYK2 and PBK, and suggest that LNX

CiteSeerX